# GRPO Fine-tuning
GRPO VI Qwen2 7B RAG
Apache-2.0
Vietnamese Retrieval-Augmented Generation (RAG) specialized large model fine-tuned based on Qwen2.5-7B-Instruct, trained using GRPO optimization method
Large Language Model
Transformers Other

G
AITeamVN
402
11
Xiyansql QwenCoder 7B 2504
Apache-2.0
A fine-tuned SQL generation model based on QwenCoder, supporting multiple dialects with excellent performance
Text Generation
Safetensors Supports Multiple Languages
X
XGenerationLab
266
2
Nano Aha Moment 3b
A 3-billion-parameter language model trained with reinforcement learning for solving mathematical reasoning tasks, especially countdown games.
Large Language Model
Transformers

N
McGill-NLP
55
2
Gemma 3 4b Reasoning
Apache-2.0
Gemma-3-4b Reasoning is a Transformer-based language model fine-tuned using the GRPO method, specializing in reasoning task optimization.
Large Language Model
Transformers English

G
ericrisco
53
2
Medqwen3b Reasoner
Apache-2.0
A medical domain-specific model based on Qwen2.5-3B-Instruct, excelling in medical reasoning and mathematical problem-solving
Large Language Model English
M
hooman650
156
12
Featured Recommended AI Models